Enhanced Evolutionary and Heuristic Algorithms for Haplotype Reconstruction Problem Using Minimum Error Correction Model

نویسندگان

  • Mehdi Kargar
  • Hadi Poormohammadi
  • Leila Pirhaji
  • Mehdi Sadeghi
  • Hamid Pezeshk
  • Changiz Eslahchi
چکیده

Construction of two haplotypes from a set of Single Nucleotide Polymorphism (SNP) fragments is referred to as haplotype reconstruction problem. One of the most important computational models for this problem is Minimum Error Correction (MEC). Since MEC is an NP-hard problem, here we propose a heuristic algorithm for haplotype reconstruction problem. The algorithm is Particle Swarm Optimization (PSO) which is an evolutionary algorithm (EA). Evolutionary algorithms are stochastic search algorithms that imitate the natural biological evolution or the social behavior of species. In contrast to MEC model, our algorithm produces results in feasible time and it could be applied to large datasets. Our results suggest that the algorithm has less reconstruction error rate compared to other algorithms. This error is also very close to zero when the algorithm is applied to actual biological data. A comprehensive comparison between PSO and four famous algorithms in the literature is presented. A discussion on input parameters influencing reconstruction error rate is also presented. * Corresponding author. E-mail address: [email protected] MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 62 (2009) 261-274

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FastHap: fast and accurate single individual haplotype reconstruction using fuzzy conflict graphs

MOTIVATION Understanding exact structure of an individual's haplotype plays a significant role in various fields of human genetics. Despite tremendous research effort in recent years, fast and accurate haplotype reconstruction remains as an active research topic, mainly owing to the computational challenges involved. Existing haplotype assembly algorithms focus primarily on improving accuracy o...

متن کامل

Haplotype assembly in polyploid genomes and identical by descent shared tracts

MOTIVATION Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and comp...

متن کامل

Consolidated Technique of Response Surface Methodology and Data Envelopment Analysis for setting the parameters of meta-heuristic algorithms - Case study: Production Scheduling Problem

    In this study, given the sequence dependent setup times, we attempt using the technique of Response Surface Methodology (RSM) to set the parameters of the genetic algorithm (GA), which is used to optimize the scheduling problem of n job on 1 machine (n/1). It aims at finding the most suitable parameters for increasing the efficiency of the proposed algorithm. At first, a central composite d...

متن کامل

Haplotype reconstruction from SNP fragments by minimum error correction

MOTIVATION Haplotype reconstruction based on aligned single nucleotide polymorphism (SNP) fragments is to infer a pair of haplotypes from localized polymorphism data gathered through short genome fragment assembly. An important computational model of this problem is the minimum error correction (MEC) model, which has been mentioned in several literatures. The model retrieves a pair of haplotype...

متن کامل

Using Harmony Clustering for Haplotype Reconstruction from SNP fragments

Single Nucleotide Polymorphisms (SNPs), a single DNA base varying from one individual to another, are believed to be the most frequent form responsible for genetic differences. Haplotypes have more information for disease-associating than individual SNPs or genotypes; it is substantially more difficult to determine haplotypes through experiments. Hence, computational methods that can reduce the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009